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Outline 



• Ambiguity 

Extensions of CFG for parsing 

• Precedence declarations 

• Error handling 
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Grammar 



Ambiguity 



E — > E+E I E * E I (E) I id 

• String 

id * id + id 
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This string has two different parse trees for the same string 



Ambiguity 

(Cont.) 



E 

E + E 
E E id 

id id 



E 

E^*E 
id E + E 
id id 
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Ambiguity 

(Cont.) 



A grammar is ambiguous if it has more than one parse tree 

for some string distinct parse trees 

Equivalently, there is more than one right-most or 
left-most derivation for some string 

Ambiguity is BAD 

Leaves meaning of some programs ill-defined 

Leaves decisions about what the program means to 
the compiler 



Dr. Sherin ElGokhy 



Which of the following grammars are ambiguous? 



Quiz 



□ $ -» $S | a | b 

□ E->E + E | id 

□ S — > Sa | Sb 

□ E->E' | E' + E 
E'-> -E' | id | (E) 
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Dealing with 
Ambiguity 




There are several ways to handle ambiguity from a 
grammer 



Most direct method is to rewrite grammar 
unambiguously 



E E +EIE 

E' -> id * E' I id I (E) * E' I (E) 



Enforces precedence of * over + (priority) 
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Ambiguity in 
Arithmetic 
Expressions 



Recall the grammar 

EH>E + E|E*E|(E)|int 

The string int * int + int has two parse trees: 
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Ambiguity: 
The Dangling 
Else 




Considerthe grammar 

E — > if E then E 
| if E then E else E 
I OTHER 



This grammar is also ambiguous 
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The 

Dangling 

Else: 

Example 



• The expression 

if E, then if E 0 then E 0 else E, 

1 2 i 4 

has two parse trees 




• Typically we want the second form 



Dr. Sherin ElGokhy 



The 

Dangling 
Else: A Fix 



• else matches the closest then 

• We can describe this in the grammar 

MIF /* all then are matched with an else */ 
UIF /* some then is unmatched */ 

MIF — > if E then MIF else MIF 

| OTHER 
UIF — > if E then E 

| if Ethen MIFelseUlF 

Describes the same set of strings 
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The expression if E 1 then if E 2 then E 3 else E 4 



The 

Dangling 

Else: 

Example 

Revisited 






• A valid parse 
tree (for a 

UIF) 



Not valid because the 
then expression is not 



a MIF 
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Ambiguity 



No general techniques for handling ambiguity 

Impossible to convert automatically an ambiguous 
grammarto an unambiguous one 

Used with care, ambiguity can simplify the grammar 
Sometimes allows more natural definitions 
We need disambiguation mechanisms 
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Precedence 

and 

Associativity 

Declarations 



Instead of rewriting the grammar 

Use the more natural (ambiguous) grammar 
Along with disambiguating declarations 



Most tools allow precedence and associativity 
declarations to disambiguate grammars 



Examples ... 
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Choose the unambiguous version 

of the given ambiguous grammar: S->SS| a | b 



Ambiguity 

Quiz 



0 S -> S'a | S'b 
S'^ Sis 



S->S|S' 
S'-> a | b 



S->SS' 
S'-> a | b 



0 S^Sa|Sb 



Dr. Sherin ElGokhy 



Associativity 

Declarations 



* Considerthe grammar E — > E + E | int 

Ambiguous: two parse trees of int + int + int 

E 

E + E 
E + E int 
int int 

• Left associativity declaration: %left + 
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Considerthe grammar E — > E + E | E * E | int 
And the string int + int * int 



Precedence 

Declarations 




• Precedence declarations: %left + 



%left * 
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Error 

Handling 

Error kind 



Lexical 

Syntax 

Semantic 

Correctness 



Purpose of the compiler is 

To detect non-valid programs 

To translate the valid ones 
Many kinds of possible errors (e.g. in C) 



Example 




... int x; y = x(3); ... 

your favorite program 



betected by ... 

Lexer 

Parser 

Type checker 
Tester /User 
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Syntax 

Error 

Handling 



Error handler should 

• Report errors accurately and clearly 

• Recover from an error quickly 

Not slow down compilation of valid code 



Good error handling is not easy to achieve 
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Approaches 
to Syntax 
Error 
Recovery 



• From simple to complex 

• Panic mode 

• Error productions 

Automatic local or global correction 



Not all are supported by all parser generators 
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Simplest, most popular method 



Error 
Recovery: 
Panic Mode 



When an error is detected: 

Discard tokens until one with a clear role is found 
Continue from there 



Such tokens are called synchronizing tokens 

Typically the statement or expression 
terminators 
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Syntax Error 
Recovery: 
Panic Mode 
(Cont.) 



Considerthe erroneous expression 

(1 + + 2) + 3 

Panic-mode recovery: 

Skip ahead to next integer and then 
continue 



Bison: use the special terminal errorto describe 
how much input to skip 

E — > int | E + E | ( E ) | error int | ( error ) 
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Syntax 

Error 

Recovery: 

Error 

Productions 



Idea: specify in the grammar known common 
mistakes 



Essentially promotes common errors to 
alternative syntax 

Example: 

• Write 5 x instead of 5 * x 

Add the production E — > ... | E E 

Disadvantage 

• Complicates the grammar 
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Error 
Recovery: 
Local and 
Global 
Correction 



Idea: find a correct "nearby" program 

• Try token insertions and deletions 

• Exhaustive search 

Disadvantages: 

• Hard to implement 

Slows down parsing of correct programs 

"Nearby" is not necessarily "the intended" 
program 

• Not all tools support it 
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Syntax Error 
Recovery: 
Past and 
Present 



• Past 

Slow recompilation cycle (even once a day) 
Find as many errors in one cycle as possible 
Researchers could not let go of the topic 

• Present 

• Quick recompilation cycle 

• Users tend to correct one error/cycle 
Complex error recovery is less compelling 

• Panic-mode seems enough 
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I hanks 



